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The derivation of the expressions of momentum and energy of a particle in special relativity is 
often less than satisfactory in elementary texts. In some, it is obtained by resorting to quantum or 
electrodynamic considerations, in others by introducing less-than-elementary concepts, like that of 
a four-vector, or even misleading ones, like “relativistic mass”. Nevertheless it is possible, following 
ideas described by Einstein in 1935, to obtain a fully elementary derivation of these expressions based 
only on the Lorentz transformations, on the conservation laws, and on the Newtonian limit. The 
resulting argument allows for a clearer and logically consistent introduction to the basic concepts of 
relativistic dynamics. 


I. INTRODUCTION 

Several texts provide an elementary derivation of the 
kinematics of special relativity, eventually based on the 
first part of Einstein’s fundamental paper of 1905.® Start¬ 
ing from the two postulates of the total equivalence of in¬ 
ertial reference systems, and of the constancy of the speed 
of light in all such systems, it is in fact easy to obtain 
the expression of the Lorentz transformation of space- 
time coordinates. A particularly simple and appealing 
derivation is obtained by exploiting Bondi’s so-called k- 
calculus,® itself based on the Doppler effect. Neverthe¬ 
less going from kinematics (the Lorentz transformation) 
to dynamics, and in particular to the relativistic expres¬ 
sions of momentum and energy, is often achieved by re¬ 
sorting to more sophisticated concepts, like that of a four- 
vector, or by the use of quantum considerations. (One 
example of this approach is the “elementary derivation” 
suggested by F. Rohrlich,® which exploits the expressions 
of the momentum and energy of a photon of frequency v.) 
Einstein himself had originally derived the mass-energy 
equivalence via an explicit use of electrodynamics,® and 
not just by the kind of kinematic considerations which he 
had developed in the first part of his 1905 paper, which 
are based on the constancy of the speed of light, but do 
not otherwise depend on Maxwell’s equations. 

Einstein had remarked this problem, and he proposed 
in 1935 an elementary derivation of the mass-energy re¬ 
lation, independent of his 1905 argument, motivating it 
with the following wordsP 

The special theory of relativity grew out of 
the Maxwell electromagnetic equations. So it 
came about that even in the derivation of the 
mechanical concepts and their relations the 
consideration of those of the electromagnetic 
field has played an essential role. The ques¬ 
tion as to the independence of those relations 
is a natural one because the Lorentz transfor¬ 
mation, the real basis of the special relativity 
theory, in itself has nothing to do with the 
Maxwell theory and because we do not know 
the extent to which the energy concepts of the 


Maxwell theory can be maintained in the face 
of the data of molecular physics. In the fol¬ 
lowing considerations, except for the Lorentz 
transformation, we will depend only on the 
assumption of the conservation principles for 
impulse and energy. 

Einstein’s considerations exploit a conceptual experiment 
introduced by G.N. Lewis and R.C. TolmarP and fur¬ 
ther discussed by P.S. Epstein,® where one considers col¬ 
lisions between pairs of particles in different inertial ref¬ 
erence frames, and one looks for the expressions of mo¬ 
mentum and energy by postulating their conservation. It 
is interesting to point out that Lewis and Tolman’s pa¬ 
per, as well as Epstein’s one, only consider elastic colli¬ 
sion and derive the relativistic expression of momentum, 
while they provide a doubtful argument for the mass- 
energy equivalence by the consideration of the change of 
the “relativistic mass” with speed. “Relativistic mass” 
is in fact a rather problematic concept,®^ and there is a 
growing consensus to avoid its introduction in the teach¬ 
ing of relativity. Einstein derives instead the equivalence 
by simply extending the argument to inelastic collisions. 
The advantage of this approach for introducing the ba¬ 
sic concepts of special relativity has been well remarked 
by R.F. Feynman who, in his Lectures (Ref. [101 Vol. I. 
Secs. 16-4, 16-5), derives the relativistic expressions of 
momentum and energy in a way that closely resembles 
Einstein’s one. (Feynman’s derivation is however marred 
by his use of the “relativistic mass”.) Einstein’s argu¬ 
ment has been more recently discussed by F. Flores,^ 
who identifies three closely related but different claims 
within the mass-energy equivalence concept, and com¬ 
pares Einstein’s 1935 argument with his original 1905 
derivatiorP and with M. Friedman’s 1983 derivation,!^ 
(p. 142ff) which rests upon the consideration of Newton’s 
equations in special relativity. 

In this note, I present this line of thought in the hope 
that it may be found useful for the presentation of these 
fundamental concepts of special relativity in introduc¬ 
tory courses for students of physics and mathematics. 
While the derivation of the relativistic expression of mo¬ 
mentum in Sec. im] is close to Epstein’s and Feynman’s 
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arguments, the discussion of the expression of the kinetic 
energy and of the mass-energy equivalence is closer to 
Einstein’s one. 


II. LORENTZ TRANSFORMATIONS AND 
DYNAMIC POSTULATES 

Following Einstein’s 1905 paper (Ref. [U § 2), the kine¬ 
matic concepts of special relativity rest on the following 
postulates: 

(i) The laws that govern the transformations of the 
state of physical systems take the same form in ref¬ 
erence frames animated by uniform translational 
motion one with respect to the other. 

(ii) There is a class of such reference frames (the iner¬ 
tial frames) in which the speed of light assumes the 
same value c, independently of the state of motion 
of its source. 

We shall choose from now on units in which c = 1 and 
limit our considerations to inertial frames. Based on 
these postulates one can easily derive Lorentz transfor¬ 
mations in the following form. Let us consider two ref¬ 
erence frames, K and K', such that K' is in uniform 
translational motion in the positive x direction and with 
speed V with respect to K. Then the event of coordinates 
{t',x',y', z') in K' has in K the coordinates {t,x,y,z), 
where 

t = j{v){t' + vxy, 

X = j{V){x +Vt'); 
y = y'] 

^ — J 
Z — Z ^ 

and we have defined 

^ (1 _ t/2)l/2- (2) 

The same relation holds for the differentials dt, dx, etc. 
Dividing by dt we obtain the rules for the transformation 
of velocities: 

dx u'^-\-V 

^ dt 1-1- u'y/ ’ 

dt yv){i + KV)^ 

dz u' 

■ji = _ = ^ 

^ dt yv){i + u',vy 

To introduce dynamical concepts we obviously need 
supplementary postulates. We shall therefore postulate 
the following: 

(iii) The momentum P and the energy if of a particle 
possessing the velocity u in the reference frame K 
have respectively the expressions 

P = muF{u); E = Eq mG{u), (4) 


where Eq is a constant, that can be called the rest 
energy^ m is a positive constant (which is relativis- 
tically invariant, i.e., does not change from one in¬ 
ertial frame to another, and thus does not depend 
on the particle’s velocity) which we shall simply call 
its mass, and E(u) and G{u) are monotonically in¬ 
creasing universal functions of u = |m|. The fact 
that the functions E and G depend only on the 
magnitude u of the velocity, but not on its direc¬ 
tion, can be inferred by the isotropy of space. 

(iv) For M <C 1 these expressions reduce to the well- 
known classical ones. One has in particular 

F{u) = 1-1- O(m^); G{u) = -I- O (m^) . (5) 

(v) The total momentum P^°^ and the total energy 
E^°^ of a system of several particles are respectively 
given by the sum of P and of E running over all 
the particles of the system. 

(vi) Conservation of momentum and energy: Let 

us assume that (elastic or inelastic) collisions occur 
in a system of particles. Then, in each reference 
frame, and maintain the same values be¬ 
fore and after each collision. 

As a corollary, the velocity of each particle remains con¬ 
stant as long as no collision occurs, because one can con¬ 
sider systems made up of single, independent particles. 

III. ELASTIC COLLISIONS AND 
RELATIVISTIC MOMENTUM 

Let us now consider a particle pair, i.e., a system made 
of two particles with equal values of m. Let us as¬ 
sume that in a reference frame K they have opposite 
velocities Ui, U 2 = —ui, where tti = (V,u, 0), with 
|w| V, V > 0. Thus |mi, 2 | — V. Let us moreover 
assume that the particles undergo an elastic collision, 
and take on respectively the velocities = {W, w, 0) 
and w -2 = {W',w',0) after it. By the conservation of 
momentum one must have W' = —W and w' = —w, in¬ 
dependently of the form of the function F{u). Indeed, one 
has Pfy = 0 before the collision. By the conservation 
law one must have 

P^°l = 0 = mwi F{wi) -\- mw2F{w2)- (6) 

Thus the vectors Wi and W 2 are antiparallel, and one has 

\wi\ ^ F{w2) , . 

\W 2 \ F{wi)' 

Since the function F{u) is monotonically increasing, this 
equation can only be satisfied if = \w 2 \, and we 
have therefore Wi = —W 2 . Now energy conservation im¬ 
poses w± = u±. Indeed, the total kinetic energy be¬ 
fore the collision is given by 2mG{u), and after it is 
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given by 2mG(w). Since G{u) is a monotonically in¬ 
creasing function of u, this condition can only be satisfied 
if w± = u. 

Let us now consider the special case where the velocity 
change is parallel to the y axis. (This rules out “head- 
on collisions”, where the particles simply exchange their 
velocity.) Let the particles’ velocities be given by u[ — 
and 1 x 2 = —u[ (cf. fig. in the K' reference 
frame. Let us now look at the same collision in a reference 
frame K moving with a velocity V' in the x direction with 
respect to K'. In this reference frame, the velocities u± 



FIG. 1. Above: Collision of a particle pair in the K' reference 
frame. Below: The same collision as seen in the K reference 
frame. Figures are schematic and no quantitative relations 
are implied. 


and therefore that 


V = w 7(1^). (II) 

One can also obtain this result by applying Eq. ^ to the 
transformation from the K' reference frame to a frame 
K" animated, with respect to K', by a uniform transla¬ 
tional motion with velocity —V parallel to the x axis. In 
this frame the x component of the velocity M 2 vanishes 
and that of the m" velocity is equal to E, and thus the 
speeds of the two particles are interchanged. 

Let us now consider the conservation of momentum. 
The change bP\ of the momentum of particle 1 is given 
by 


6P\ = —2mvF{v)ey, (12) 

where Ey is the y axis versor. The corresponding quantity 
for particle 2 is given by 

SP 2 = 2mw F(U 2 ) By, (13) 

where U 2 = vF. Let us momentarily assume 

v,w V: then F{v) ~ 1 and U 2 — V. From momentum 
conservation we obtain SPi -I- dP 2 = 0, which implies 


mv = mw F(y). 


(14) 


Since w = v/j{V), we obtain 

F{V)='y{V) = , ^ 


(15) 


Having obtained this result for u, w <C H, it is easy to 
see that it also holds for larger values of v and w, by sub¬ 
stituting V with the speed of the corresponding particle. 
We have in fact 


mvF(v) = mw F{u 2 ) 


mv 

7 ( 1 /) 


F{u2), 


from which it follows 


(16) 


of the particles are respectively given by 

Ml = (0,w, 0), 

U2 = {-V, -w,0). 


where 


V = 


2V' 


l + V 


2 ’ 


V = 


v' 

w = - 

j{V'){l + VF) 

One can easily check that 

7(H) = ^ ^ + 


Vl - 1/2 1 _ 


F{u 2 ) = F{vh{V), 


( 8 ) 


namely 

I _ I I 

v/1 - ~ 

an identity which is easy to check directly. 


(17) 


(18) 


IV. KINETIC ENERGY CONSERVATION 

Let us consider a particle having the velocity u' = 
{u', 0, 0), parallel to the x axis, in the K' reference frame. 
Its velocity u in the K frame, in uniform translational 
motion with respect to K' with speed V in the direction 
of the positive a;-axis, is given by m = (it, 0,0), with 


( 10 ) 


u' + V 
l + u'V 


(19) 
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One can easily see that 


V. MASS-ENERGY EQUIVALENCE 


'y{u) = {I + u’V)^{u’)^{V). (20) 

If u' is not parallel to the x axis, but one has instead 
u' = (it(,, Uy, ), one has the more general relation 

7(|m|) = (1 + <U)7(u )7(^), (21) 

which can be obtained with a little algebra. It is also 
easy to check that 

Uxliv) = « -f V)j{u')^{V); 

Uyj{u) = Uyj{u')-, ( 22 ) 

Uz7(u) = u'^jiu'). 

Let us now consider a particle pair, which have op¬ 
posite velocities u[, u'^ = —u\ in the K' frame. Thus 
u' = \u\\ = is the common value of the particles’ 
speed in the K' frame. Let us denote by Ui and U 2 the 
corresponding velocities in the K frame. We obtain 


Let us now consider a totally inelastic collision in a 
particle pair. In the reference frame K' in which = 
0, the total kinetic energy before the collision is given by 

7i; = 2m(7(n')-l), (29) 

For simplicity, we shall assume that the particles veloc¬ 
ities u'l 2 are parallel to the y axis. The total kinetic 
energy after the collision vanishes, but the energy of the 
resulting particle has increased by Tf^. By momentum 
conservation the resulting particle is at rest in the refer¬ 
ence frame K' and has thus velocity V parallel to the x 
axis in the K frame. Let us denote by M its mass. In 
the K frame the total momentum before the collision is 
given by 

P = rn (i 6 i 7 (ui) -I- ^ 27 (^ 2 )) = 2mV')(u')^(V), (30) 

while after the collision it has the value 

P = MVj{V). (31) 


7 (ui) -b 7(ii2) = 2"/{u')j(y). (23) 

We have seen that an elastic collision in the K frame can¬ 
not change the common value u' of the particles’ speed. 
Thus the right-hand side of this equation cannot change 
in the collision. But then neither can its left-hand side. 
If we denote by Wi and W 2 the particles’ speeds in the K 
frame after the collision, we obtain 

'r{ui)+j{u2)=jiwi)+j{w2). (24) 

As Einstein (Ref. 0 P- 227) points out, these equa¬ 
tions have the form of conservation laws. We can thus 
interpreiP^ m ( 7 (u) — I) as the kinetic energy T of a par¬ 
ticle with mass m animated by a velocity u. For small 
values of u this quantity is thus given by 

T = m — 1) ~ , (25) 

in agreement with the classical limit. We can thus set 

G{u) = 'y(u) — 1. (26) 


We obtain therefore 

M = 2mj(u') = 2m + (32) 

Therefore when two particles collide inelastically to form 
a new particle, the mass of the resulting particle is larger 
than the sum of the masses of the colliding particles by a 
quantity exactly equal, in our units, to the kinetic energy 
transformed by the collision into other energy forms. One 
can thus introduce a “natural” choice of the energy at 
rest Eq of a particle, by equating it (in our units) to 
its mass, also because, “from the nature of the concept, 
[that] is determined only to within an additive constant, 
one can stipulate that Eq should vanish together with 
m.'^ (p. 229). We can then identify m'y{u) with the 
total energy of a particle, and associate the change 6E of 
energy from the kinetic to a different form with a change 
6m = SE of the mass of the particle. In the more general 
case, in which the two particles do not coalesce, but are 
animated by velocities with the same magnitude w' in 
the K' frame after the collision, one can then express the 
conservation of energy by 

2m^(u) = 2m'j{w'), (33) 


Let us remark moreover that, by applying Eqs. (22) to a 
particle pair, we obtain 


Uijiui) + U 2 "t{u 2 ) = 2V^{u')jiV). (27) 


where fh is the mass of each particle after the collision. 
The same applies in the K frame, if we take into consid¬ 
eration the relation 

7(u) = 7 (u')7(^), (34) 


We can thus derive the following relation: 

Ml7(ui) + U2j(u2) = + W2j(w2), (28) 

which can be interpreted as the conservation law for the 
momentum. We thus recover the relativistic expression 
of the momentum derived in sec. IIIII 


which holds since both Ui and U 2 are parallel to the y 
axis. 

It is then a simple matter to verify that the energy E 
and the momentum P of a particle of mass m satisfy the 
relation 


77^2 77)2 _ 2 

L — F = m , 


( 35 ) 
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where the right-hand side is relativistically invariant. 
This relation also holds in the limit of a zero-mass parti¬ 
cle, for which it assumes the form 

E=\P\. (36) 


of the relativistic expression of momentum, contained in 
the book by G.D. Birkhoff and R.E. Langer, Relativity 
and Modern Physics^^ exactly for this reason, as made 
clear by the closing paragraphs of his 1935 paper.^^ 


As a corollary, mass is not additive: the mass of a system 
of particles depends on its total energy content in the 
vanishing-momentum frame. Thus when light waves (of 
vanishing mass) with opposite momenta are emitted from 
a particle at rest, the mass of the particle changes (as 
argued in Einstein’s 1905 work. Ref. HI). Since the right- 
hand side of Eq. (35) is invariant, it can be taken as 
the starting point to show that the energy E and the 
momentum P combine into a relativistic four-vector, a 
property which generalizes to particle systems. 

One should point out that this line of thought does not 
rest on Maxwell’s equations (keeping only the constancy 
of the speed of light) and neither does on other mechan¬ 
ical concepts, in particular on the concept of force, that 
is difficult to justify in special relativity. Einstein crit¬ 
icizes the use of the concept of force in the derivation 


Thus, in the book just mentioned, essen¬ 
tial use is made of the concept of force, which 
in the relativity theory has no such direct sig¬ 
nificance as it has in classical mechanics. This 
is connected with the fact that, in the lat¬ 
ter, the force is to be considered as a given 
function of the coordinates of all the parti¬ 
cles, which is obviously not possible in the 
relativity theory. Therefore I have avoided 
introducing the force concept. 

Eurthermore, I was concerned with avoid¬ 
ing making any assumption concerning the 
transformation character of impulse and en¬ 
ergy with respect to a Lorentz transforma¬ 
tion. 
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